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Abstract. We exhibit proofs of Furstenberg's Multiple Recurrence Theorem 
and of a special case of Furstenberg and Katznelson's multidimensional version 
of this theorem using an analog of the density-increment argument of Roth and 
Gowers. The second of these results requires also an analog of some recent 
finitary work by Shkredov. 

Many proofs of these multiple recurrence theorems are already known, but 
the approach of this paper sheds some further light on the well-known heuristic 
correspondence between the ergodic-theoretic and combinatorial aspects of mul- 
tiple recurrence and Szemeredi's Theorem. Focusing on the density-increment 
strategy highlights several close points of connection between these settings. 
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1. Introduction 

In 1975 Szemeredi published the first proof of a long-standing conjecture of 
Erdos and Turan concerning arithmetic progressions in dense arithmetic sets. 
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Theorem 1,1 (Szemeredi's Theorem). If E C Z admits some 6 > for which 
there are arbitrarily long intervals [M, N] with 

\Elr^[M,N\ \ > 5{N- M) 

(that is, E has 'upper Banach density' equal to at least 6), then E also contains 
for every k > 1 a nondegenerate arithmetic progression of length k: 

E D {a, a + n, a + 2n, . . . ,a + (k — l)n} for some a £ Z,n > 1. 

Separate proofs for various special cases were given earlier by Roth and by 
Szemeredi himself, and now the thirty-five years subsequent to Szemeredi's break- 
through have seen the emergence of a host of alternative approaches to this theorem 
and several generalizations. 

The many techniques that have been brought to bear in this investigation are 
loosely drawn from three areas of mathematics: 

• graph and hypergraph theory (in work of Szemeredi, Solymosi, Nagle, 
Rodl, Schacht, Skokan, Gowers and others), 

• ergodic theory (largely building on ideas of Furstenberg and Katznelson), 

• harmonic analysis (following Roth, Bourgain, Gowers, Green, Tao and 
Shkredov). 

The alternative arguments constructed from these three bodies of theory sometimes 
correspond much more closely than is initially apparent, owing to many differences 
in technical detail that turn out to be quite superficial. No really comprehensive 
overview of the relations among these approaches is yet available, but fragments 
of the picture can be found in the papers UKraO?! IGT07I ITao06ai and in Chapters 
10 and 1 1 of Tao and Vu's book fTYW\. 

The purpose of the present note is to extract one aspect of the harmonic analytic 
approach — the 'density-increment argument', originating in the early work of 
Roth IIRot53l — and present a natural analog of it in the rather different setting of 
ergodic theory. No new theorems will be proved except for some technical results 
needed on route, but I hope that this alternative presentation of existing ideas will 
contribute to enhancing the toolkits of those working on this class of problems, 
and also shed some light on the open questions that remain concerning the density- 
increment approach. 

1.1. Ergodic Ramsey Theory. Two years after Szemeredi's proof of Theorem 11.11 
appeared, Furstenberg offered in IFur771 a very different approach to the same 
result based on a conversion to a problem in ergodic theory, using what is now 
referred to as 'Furstenberg 's correspondence principle'. 

A precise formulation of the general correspondence principle can be found, for 
example, in Bergelson [Ber96|. Here we simply recall that Furstenberg proved the 
equivalence of Szemeredi's Theorem to the following: 

Theorem 1.2 (Multiple Recurrence Theorem). IfT : Z r\ {X, fi) is a probability- 
preserving action on a standard Borel probability space and A <Z X is measurable 
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and has IJ,{A) > 0, then also 

1 ^ 

lim inf — y ^(A n T"" A n • • • n T-^'^-^)" A) > VA: > 1. 

71=1 

Furstenberg's proof of Theorem 11.2 I relied on a powerful structural classification 
of probability-preserving dynamical systems developed independently by Fursten- 
berg and by Zimmer ( IIZim76bl IZim76all ). 

Shortly after that proof appeared, Furstenberg and Katznelson realized that only 
a modest adaptation yields a significantly stronger result. 

Theorem 1.3 (Multidimensional Multiple Recurrence Theorem). TfTi, T2, . . . , T^^ : 

Z nv (X, [i) are commuting probability-preserving actions on a standard Borel 
probability space and A C X has ij-{A) > then also 

1 ^ 

lim inf — V n Tr"A n • • • n T7"A) > o. 

71=1 

This appeared in IIFK78n . Theorem 11.21 follows from Theorem 11.31 by setting 
d := k — I and Tj := for i < — 1. On the other hand, Theorem 1 1.3 1 also has a 
combinatorial consequence that strengthens Szemeredi's Theorem: 

Theorem 1.4. If E C Z*^ admits some 5 > for which there are cuboids ni<d[-^i' 
with minj<^ \Ni — Mi \ arbitrarily large and 



EnYl[Mi,N,] >6l[{N,-Mi 



i<d i<d 

then E also contains the set of vertices of a nondegenerate upright right-angled 
isosceles simplex: 

E D {a, a + nei, . . . , a + ne^} for some a G WJ^ , n > 1. 

Interestingly, this result went unproven by purely combinatorial means until the 
development of hypergraph analogs of Szemeredi's famous Regularity Lemma by 
Nagle, Rodl and Schacht IINRS06L Gowers HGowOTII and IITao06bL working than 
twenty years later. In addition, several other purely combinatorial assertions have 
now been accessed though 'Ergodic Ramsey Theory', the subject that emerged 
from Furstenberg and Katznelson's early developments, including a density version 
of the Hales-Jewett Theorem IIFK91I1 and a density Ramsey Theorem for subtrees 
of trees IIFW031 . 

Within ergodic theory, a great deal of energy has now been spent on obtaining 
the most precise possible understanding of the averages whose limit infima are the 
subjects of Theorems ll.2l and ll.3[ we will return to some of these developments 
later. 

1.2. The density-increment argument. The 'density-increment argument' was 
first used by Roth for his early proof of the case /c = 3 of Theorem 11.11 Much 
more recently, Gowers developed in IIGow98l IGowOU an extremely sophisticated 
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extension of Roth's approach, and using this was able to give a density-increment 
proof of the full Szemeredi Theorem. 

We will not spend time here on the many technical accomplishments involved 
in Gowers' work, requiring a call to tools from yet other parts of arithmetic combi- 
natorics such as Freiman's Theorem. Rather we record just a simple statement of 
the density-increment proposition that Ues at its heart. 

Proposition 1.5. Suppose that 6 > 0, that N is sufficiently large and that E C 
{1, 2, . . . , N} has \E\ > 5N but contains no k-term arithmetic progression. Then 

there is an arithmetic progression P C {1, 2, . . . , iV} ofsiz.e at least iV({^/2)'' )' 
such that 

\EV\P\> (<5 + ((V2)''^')^''^')|i'|. 

□ 

This proposition is implicit in HGowOU . but does not appeal" in the above form 
because Gowers presents his argument in terms of the crucial auxiliary notion of 
'higher-degree uniformity', and splits the above result into several pieces that are 
connected via this auxiliary notion. 

This kind of uniformity is defined in terms of the Gowers uniformity norms 
(Section 3 of MGowOll : see also Chapter 1 1 of Tao and Vu IITV06I ) that have since 
become widely used in additive combinatorics. Uniformity of degree 1 can be 
described simply in terms of the presence of some large values among the Fourier 
coefficients of l^;, regarded as a function on the group 'L/NZ,; this is essentially 
the notion that Roth uses in his approach for = 3. Higher-degree uniformity 
extends this property, although it is not so easily described using Fourier analysis. 
In his more general setting, Gowers proves on the one hand that if E is sufficiently 
uniform of degree k — 2 then it contains a fc-term arithmetic progression (Corollary 
3.6 in MGowOll ). and on the other that if E is not sufficiently uniform of degree 
k — 2 then we may partition {1,2,..., N} into long arithmetic subprogressions 
such that E has a relative density inside some of these subprogressions that is 
substantially larger than 5 (Theorem 18.1 in HGowOll ). This fact can then be used 
to pick out one such subprogression satisfying the above conclusion (Lemma 5.15 
in MGowOll ). Proposition 11.51 amounts to the conjunction of these facts. 

Our proof of Theorem 1 1 . 2 I belo w takes a similar form (although we should stress 
that our task is much simpler than Gowers'), using an ergodic -theoretic analog of 
the notion of 'uniformity' arising in work of Host and Kra. Similarly to the presen- 
tation in IGowOU, we will find that handling the consequences of non-uniformity 
is the more compUcated of the two steps involved. 

From Proposition 11.5 1 a proof of Szemeredi's Theorem follows quickly by con- 
tradiction. If is a counterexample of density 5 and N is sufficiently large, then 
for a subprogression P as given by Proposition 11.51 we see that E n P, identified 
with a subset of {1, 2, . . . , |P|} by the obvious affine map, is another counterex- 
ample with density that exceeds 5 by an amount depending only on 6 and k. It 
is contained in a discrete interval of length |P| = N'^^^''^^ for some small fixed 
K,{6, k) > 0. Therefore, provided was sufficiently large to begin with, iterating 
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this construction must eventually turn a counterexample of density at least 6 into a 
counterexample of density greater than 1: an obvious contradiction. 

In addition to its aesthetic value, Gowers' new proof of Szemeredi's Theorem 
gives much the best known bound on how large N must be taken in order that 
a /c-term arithmetic progression is certain to be found in a density-(5 subset E C 
{1,2,..., A^}. In view of this, it was natural to ask whether this approach could 
also be brought to bear on the multidimensional Theorem 11.41 in order to give a 
similarly striking improvement to the bounds available there. Gowers poses this 
problem explicitly and offers some discussion of it in his survey HGowOOl . Recently 
Shkredov has made the first serious progress on this problem by essentially solving 
the case d = 2m [ Shk06bi . applying some important new technical ideas that are 
needed to prove and then use a relative of Proposition 11.51 However, a further 
enhancement of these ideas that will yield a density-increment proof of the full 
Theorem 1 1.41 with or without improved bounds, still seems relatively distant. 



1.3. Outline of this note. The centrepieces of this note are 'density-increment' 
proofs of the Multiple Recurrence Theorem 1 1.2 l and the case d = 2 of Theorem 1 1.3 1 
corresponding to Gowers' and Shkredov's combinatorial implementations of the 
density-increment argument respectively. 

The main steps taken by Gowers and Shkredov do have counterparts in these 
proofs, but we need different structural results from within ergodic theory to en- 
able them. These will largely be drawn from recent studies of the 'nonconventional 
ergodic averages' whose limit infima appear in Theorems ll.2l andl l.3l In particular 
we rely on the method of 'characteristic factors', which has emerged through the 
works of several researchers since Furstenberg's original paper PFurVVI, and espe- 
cially on some of the technical steps in Host and Kra's proof (|HK05]) of conver- 
gence for the averages of ll.2l and in the work of Conze and Lesigne [|CL84[|CL88al 
ICLSSb l and the subsequent works f Aus091 lAuslOl on the multi-dimensional case. 
Many other researchers have contributed to this story within ergodic theory, includ- 
ing Rudolph, Zhang, Katznelson, Weiss, Ziegler, Frantzikinakis and Chu, and the 
reader is referred to [Ausa | for a more complete discussion. 

The basic ergodic theoretic version of the density-increment argument for Theo- 
rem l 1.21 will be introduced in S ubsection 12 .2 1 and then used to complete the proof of 
that theorem later in Section |2] Although a density increment is central to Shkre- 
dov's proof as well, he uses it in a slightly more complicated way, and so in Sec- 
tion [3] we introduce the ergodic theoretic analog of this separately and then use it 
to prove the case d = 2 of Theorem 1 1.3 1 

On the one hand, I hope that these proofs shed some light on the nature of the 
density-increment argument. On the other, it seems that recent progress in ergodic 
theory is beginning to address some of the problems of extending this approach to 
give a density-increment proof of the whole of Theorem 11.31 (and so, one might 
hope, also to give a finitary density-increment proof of Theorem 11.41 as requested 
by Gowers). In the final Section|4]we will draw on results from [ AusailAiisbilAuscll 
to sketch some of the further developments suggested by this progress. 
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2. Powers of a single transformation 

2.1. Preliminary discussion. In this section we show how the density-increment 
strategy can be used to give a proof of Theorem 11.21 building on two important 
ergodic-theoretic ingredients. Let us first recall a convenient definition. 

Definition 2.1 (Process). We will refer to a probability-preserving "L-system (X, /i, T) 
together with a distinguished subset A as a process and denote it by {X D A, p,T). 

Definition 2.2. An ergodic process {X D A, /i, T) has no k-APs in its return 
times if 

H{A n r-"A n • • • n t-^^^'^^'^A) = O foralln£Z\ {0}. 

Clearly if ij.{A) > then the above property is stronger than being a coun- 
terexample to the Multiple Recurrence Theorem, which requires that the relevant 
intersections have positive measure on average, not just for a single nonzero n. 
Since that theorem turns out to be true, the above definition is essentially vacuous, 
but it will be a convenient handle at various points during the proofs that follow. 

The first ingredient we need is a corollary of the recent result of Host and 
Kra ||HK05n that the limiting values of the multiple recurrence averages are pre- 
cisely controlled by certain special nikotation factors of a system {X, fx, T). 

Definition 2.3 (Nilrotations). For any k > 1 a k-step nilrotation is a Z-system 
on a homogeneous space G/T for G a k-step nilpotent Lie group and T < G a 
cocompact discrete subgroup, where G/T is endowed with its normalized Haar 
measure m and the transformation is given by 

Rg-.hV^ ghV 

for some g & G. 

Theorem 2.4 (Host-Kra Theorem). For each k > 2, any ergodic Z-system X = 
{X, fi, T) has a factor map ■Kk-2 '■ ^ ~^ onto a system generated by an 

inverse sequence of (k — 2)-step nilrotations such that 

n=l 

1 ^ r 

~ /vE/ E;.(/okfc-2)-(E^(/ikfe-2)or-) (E^(A_i|vrfe_2)or'=")dA. 

n=l •'^ 

as N ^ oo for any /q, /i, . . . , fk-i £ L°°{p,), where the notation asserts that the 
difference between these two sequences of averages tends to as N ^ oo. □ 

Remark. The above result is often expressed by asserting that the factor TTk-2 is 
characteristic for the averages in question. This theorem first appears in [HKOS], 
where its proof invokes a family of seminorms on L°°{p) that Host and Kra intro- 
duce for this purpose and that are closely analogous to Gowers' uniformity semi- 
norms from [GowO l l. so offering another point of proximity between the ergodic 
theoretic and quantitative approaches. Another proof of Theorem 12 .4 l has now been 
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given by Ziegler in IIZie07ll . who also shows that the maximal factor of X generated 
by {k — 2)-step nikotations is also the unique minimal factor that is characteristic 
in the above sense. <l 

With the Host-Kra Theorem in mind, the second result that we use is simply the 
fact that multiple recurrence does hold for nilrotations. 

Theorem 2.5 (Multiple recurrence for nilrotations). If Rg G/T is an ergodic 
nilrotation, A <Z G/T has positive measure and K > \ then there is some r > 1 
such that 

m{g-^'A n g-^^-^^^A n • • • n g^' A) > 0. 

□ 



In fact, this result is considerably simpler than Theorem 12.41 which really does 
the heavy lifting in what follows. The point is that the orbit of the diagonal A := 
{{x,x,...,x) : X G G/T} C (G/r)^^+^ (or rather, its normalized surface 
measure m^) under the off-diagonal transformation R{^g-K g-K+i gX-^ (which is 
clearly still a nilrotation acting on (G/r)^^+^) can be shown to equidstribute in 
some finite union of closed connected nilsubmanifolds of (G/r)^^+^ that contains 
the whole of this diagonal set. This follows from strong results classifying all 
ergodic invariant measures for nilrotations. From this point a fairly elementary 
argument gives the positivity of 



liminf — V mig-^'^A n g-^^-^^^'A n • • • n g^'^A) 

n=l 

1 ^ f 

= ^TJ^^nY. / lAxAx...xAoi?f^_K,,-K+i,„„,K)dmA, 
n=l 

and also the fact that these averages actually converge, so this limit infimum is re- 
ally a limit. A related instance of this argument can be found presented in detail in 
Section 2 of the work HBLLOSII by Bergelson, Leibman and Lesigne, who use it for 
the related end of proving multiple recurrence along certain families of polynomi- 
als. Equidistribution results for nilrotations on which this reasoning can be founded 
are available in either Ziegler HZieOSI or Bergelson, Host and Kra IIBHK05n . which 
in turn build on older works of Parry ||Par69J IPar70J IPar731 . Lesigne l|Les91|| and 
Leibman IILei98[lLei05l . 

With Theorems 12.41 and 12.51 at our disposal, it is relatively easy to lay out a 
density-increment proof of the full Multiple Recurrence Theorem. However, it is 
important to observe right away that this is a rather perverse thing to do, because the 
above two ingredients also imply that theorem through the following even quicker 
argument: 

• given our process {X D A, fi, T), we wish to prove that 

1 ^ 

lim inf — y ^(^ n r-"yi n • • • n t-^^^-^^a) > o, 

n=l 
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SO by Theorem 12.41 it suffices to prove instead that 

n=l "^^ i=0 

with ■Kk-2 '■ X — )■ Zfc_2 the inverse limit of nilrotation factors from that 
theorem (and where we write E(A 1 7rfe_2) as short for ^{1a | tta,— 2)); 
• this, in turn, will follow if we prove that 



1 ^ 

liminf — y n T-'^B n • • • n r-('=^i)"fi) > o 



n=l 

where B := {E{A \ 'Kk-2) > for any positive e chosen so small that 
^{B) > (for example, e < /u(A)/2 will do); 

finally, importing a simple trick from [f K78il . this follows by choosing a 
further factor a : Zk-2 G/F onto a finite-dimensional nilrotation such 
that 

llE(^|7rfc_2) - E(^|ao7rfc_2)||i < 



100(A; + 1) 



(which is possible because 7rk-2 is generated by an inverse sequence of 
such further factors a) and letting 

C := {e{B \ a o 7rk-2) > I - -^}, 
for which we now easily deduced that 



liminf — n r-"5 n • • • n t-^^-^^'^b) 

1 1 ^ 

> - lim inf — y /,(c n T-"c n • • • n r-(fc"i)"c7), 



2 iV 

n=l 

which Theorem 12.5 1 shows is strictly positive. 

(This proof is also essentially that used in HBLLOSII for their instance of polynomial 
recurrence.) 

Therefore the point of this section is not to provide a serious new approach to the 
Multiple Recurrence Theorem, but rather to exhibit the density-increment strategy 
in a setting familiar to ergodic theorists. 

The reason why the approach to multiple recurrence just sketched does not have 
a clear analog among quantitative proofs of Szemeredi's Theorem is hidden in 
our appeal to Theorem 12.41 In fact, the technical result that drives Gowers' work 
is really more analogous to the following easy corollary of Theorem 12.41 than to 
Theorem 12.41 itself : 
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Corollary 2.6. If an ergodic system (X, ^, T) and measurable functions /o, /i, • • • , fk-i 
X —7- [—1,1] are such that 

limsup - ^ / /o • (/i o T") o r(^-i)") > 7 > 0, 

then there is a factor map vr : (X, n, T) — >■ (G/F, m, -Rg) onto a« ergodic {k — 2)- 
nilrotation such that 

||E(/j 1 7r)||2 > ^7 for each i = 0, 1, . . . , - 1. 

/n particular, if {X D yl, /x, T) a process with fJ-{A) > 6 but no k-APs in its 
return times then there is such a factor map tt for which 

\\E{A\^)-^,{A)h>^6^ 

Proof. By Theorem 12.41 the averages in question have the same asymptotic be- 
haviour as the averages 

N 

n=l •'^ 

with f[ := E(/j 1 7rfc_2), and now all these functions still lie in the unit ball of 
and so for any i we can apply the Cauchy-Schwartz inequality to fi o 
r*" and the product of the remaining factors to deduce that the above average is 
bounded in absolute value by ||/,/||2- Since the sum of these averages must be 
greater than 7/2 infinitely often, this requires that ||/,- II2 > 7/2 for each i; finally, 
since 7Tk-2 is generated by further factor maps onto finite-dimensional {k — 2)-step 
nilrotations, letting vr be a large enough one of these gives the first conclusion. 
To derive the second conclusion, first use Theorem l2.4l to obtain 

N N k-1 

- /.(^ n T-'^A n • • • n r-(^-i)M) ^ E / 11 I ""^-2) o dfi 

n=l n=l i=0 

as ^ 00, SO that if A contains no /c-APs in its return times then both of these 
expressions must vanish as — > oo. Now use the decomposition 

E{A 1 7rfc_2) = (E(A I 7Tk-2) - li{A)) + fi{A) 

to form the telescoping sum 

N ,. k-1 



^ J X ■_r^ 

I ^^{Ar■mA\^u-2)oT--^,{A)) 

JX 



n=l i=0 
k-1 ^ N 

E 

i=0 n=l 

• n (E(^kA.-2)°r^'")d/i 

i<j<k-l 

+^i{Af. 
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Since this tends to 0, the first k terms of the sum must asymptotically cancel the 
term fJ.{A)^ > S^, and hence at least one of these first terms must have magni- 
tude arbitrarily close to ^6^ for infinitely many N. The first part of the corollary 
therefore gives some {k — 2)-step nilrotation factor vr for which 

\\E{A\7r)-^^{A)\\2>^6^ 
as required. □ 

Within Roth's and Gowers' works lie quantitative analogs of the above result: 
this is what drives Gowers' proof that a failure of uniformity of degree A; — 2 for a 
density-(5 set C {1, 2, . . . , N} gives a partition of {1, 2, ... , N} into fairly long 
subprogressions on which E enjoys an enlarged relative density (Theorem 18.1 
in HGowOID . 

Heuristically, Gowers shows first that a failure of uniformity of degree k — 2 
implies a nontrivial correlation between 1e — 6 and a function on {1, 2, ... , N} 
which behaves like the exponential of i times a real polynomial of degree {k — 2) 
on many large subprogressions of {1,2,..., N}. He then converts this correlation 
into the desired partition of {1,2,..., N} into long subprogressions. This corre- 
lation with a function that behaves 'locally' like a de.gree-{k — 2) polynomial is 
the analog of having a nontrivial conditional expectation onto a. {k — 2)-step nil- 
system. The exact formulation of the finitary 'inverse theorem' for the failure of 
higher-degree uniformity is rather complicated, and we omit it here, but again a 
gentle introduction with many further references can be found in the book IITV06I 
of Tao and Vu. 

In the infinitary ergodic-theoretic setting the implication of Corollary 12.61 by 
Theorem 12.41 can easily be reversed: given any indicator function 1a we can de- 
compose it as 1a = {1a — E(j4 [ TTk-2)) + E(A | 'Kk~2), and now if we form a 
telescoping sum for the expression ^ J2n=i /"(^ ^ T'^^A n • • • D r"^''"^)"^) 
similar to the above then (the contrapositive of) Corollary |2.6| implies that all the 
terms involving 1a — E.{A \ 7rfc_2) must vanish as — )• oo, leaving us with Theo- 
rem |24l 

N Af „ fc-1 

- ^ /.(^ n T-M n ■ ■ ■ n r-(^-i)M) ~ ^ E / 11 I ^^^-2) o d^u. 

n=l n=l "^^ 1=0 

However, difficulties emerge when one tries to develop a quantitative analog 
of this reverse implication, and so provide a truer analog of Theorem 12.41 in the 
finitary setting. In order to make sense of either conditional expectations such as 
E{A 1 7rfc_2), or of the structure of 11^-2 as an inverse limit of a possibly-infinite 
collection of nilrotation factors, one needs a quantitative analog of taking a limit 
in ^^(/i). In practice this leads to an explosion in the bounds obtained. Although 
something in this vein is possible (see Tao [Tao06a|), in general it is much less effi- 
cient than the density-increment strategy, for which (the finitary analog of) Corol- 
lary |2]6] suffices. On the other hand, our quick presentation above of the deduction 
of multiple recurrence from Theorems I2.4l and 12.5 I clearly uses the full strength of 
Theorem 12.41 and so if instead we started from Corollary 12. 6 1 it would require us 
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to prove Theorem I2.4l first before proceeding as above. In the next subsection we 
will see that the density-increment strategy, by contrast, uses only the conjunction 
of Corollary I2.6l and Theorem 12.51 and it is this feature that accounts for its greater 
efficiency (leading to better bounds) in the finitary world. 

2.2. The density-increment proof. The strategy here is to prove Theorem 11.21 
or ll.3l by 'induction on The technical result underlying this is an ergodic- 

theoretic analog of Proposition 11.51 

Proposition 2.7 (Ergodic-theoretic density-increment). For each k > I there is 
a function Ck '■ (0, 1] — > (0, 1] that is bounded away from on compact subsets 
such that the following holds: if {X D A, n, T) is a process with ijl{A) = 5 > 
but no k-APs in its return times, then for every e > and N > 1 there are some 
non-negligible B C X and integer r > 1 such that 

fiiBAT'^B) < efi{B) 

and 

H{A I T-'^'B) >5 + Ck{6) for all -N <n<N. 

Before proving this result let us see why it implies Theorem II. 21 

Corollary 2.8. With Ck as in Proposition 12. 71 the following holds: if there exists a 
process {X D A^ fi, T) having ^jl{A) = 5 > {) but no k-APs in its return times, 
then then is another process (Y D B,u,S) having ^{B) > 5-\-Ck{6) but no k-APs 
in its return times. 

Proof of Corollary from Proposition lTTl By Proposition 12.71 for any N we can 
find a non-negligible B^ C X and r^v > 1 such that 

/x(S^.AT-S^.) < ^ 

and 

fi{A I r-^^"(S7v)) >6 + Ck{6) for all - < n < A^. 

Let UN be the probability measure on y = {0, 1}^ that is the law of the random 
sequence 

^N-x^ (lA(T'^^"(x)))„gz 

for X drawn at random from fi{ ■ \ Bn) (that is, x is chosen 'uniformly from Bn')- 
Let : y ^ y be the coordinate left-shift and 

Aa := {iuji)iez : = 1} C y for a G Z, 

so Aa = S~"-{Ao). The lower bound on the measures h{A\T-'''^"-{Bn)) for 
—N<n<N implies that any vague accumulation point u of the sequence uj^, 
say u = limj_^oo i^N^, niust satisfy 

u{Aa) = lim fi{A I r^^»'^(5,vj) > S + Ck{5) Va. 
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On the other hand, the assumption that there are no /c-APs in the return times of A 
implies that 

m{Aa n Aa+r H • • • R ^a+(fc-l)r) 

= ^N{{i^i)i& ■ = ^a+r = • • • = iOa+{k-l)r = 1} = 

for all a € Z, r > 1 and all N, and so the same is true for u. 

Finally, the inequality ij,{Bj\[AT^'^Bn) < ijl{Bn)/N implies for any Borel 
C C y that 

\un{C) - un{S-^C)\ = \^l{ip-^^^C\BN)- ^l{T-'''^-^^c\BN)\ 

= \fi{ip],'C\Br,)-fi{^],'C\T'-^BN)\ 
ti{^],'CniBr,AT^^Br,)) 

so the vague limit v is also S'-invariant. Letting B := Aq, this gives a process {Y D 
B,v,S) with no fc-APs in its return times and the desired improved bounds. □ 

Proof of Theorem \L2\ from Corollar\ \2.8\ Step 1 If (X D A, fj, T) is any coun- 
terexample to Theorem II .2 1 with 5q := ii{A) > 0, then a simple vague limit argu- 
ment can enhance it to an example with the same density value 6o but no A;-APs in 
its return times. This construction forms the bulk of this proof. 

We first transfer our initially-given example onto the space Y := {0, 1}^ with 
the left-shift S. Let 

B := {{u)i)iez : = 1}, 
and now consider the map 

f^:X^Y:x^{lA{rix)))^^^. 

This intertwines T with S, so the pushforward ui := [fj^)^fj, is an 5-invariant 
Borel measure on Y for which z^i(-B) = fJ-{A) and 



lim inf — S" MB n s-'^B n ■■■ n s-^^-^^'^B) 

1 ^ 

= lim inf — y n r-"A n • • • n r-^^-^)" A) = o. 



n=l 



This implies that the corresponding averages along any subset a • N C N, a ^ 1, 
also tend subsequentially to zero: 

1 ^ 

lim inf — y z^i n n • • • n = 0, 



N^oo N 

n=l 



because for large N the terms corresponding to n G a • N account for about 1/a of 
the full average. 

Now let Vk be the image measure of ui under the coordinate-dilation 
dilfc -.Y^Y: {uji)i^z ^ {^^ki)iez, 



ERGODIC-THEORETIC DENSITY-INCREMENT 



13 



SO that each Uk is still shift-invariant and satisfies ffc(-B) = i^i{B), since B = 
dil^^{B) because it depends only on the zeroth coordinate. Letting u be any limit 
point of the averages J2k=i in the vague topology, the above convergence 
tells us that 

u{B n s-^B n • • • n s-^'^-^'^'^B) = o 

whenever a ^ 0. 

Step 2 Having made this simplification, Corollary 12.81 gives a new coun- 
terexample with density Si > 6o + Ck{6o). Since is bounded away from on 
the subinterval [Sq, 1] C (0, 1], after finitely many iterations this procedure gives a 
counterexample with density greater than 1, and hence a contradiction. □ 

Before presenting the proof of Proposition 12.71 we need one further enabling 
result, for which we will make our appeal to Theorem 12.51 From that theorem we 
need the consequence that one can approximately decompose an arbitrary positive- 
measure U C G/T into a collection of almost-invariant sets for different powers 

OfRg. 

Proposition 2.9. If Rg r\ {G/T,m) is an ergodic nilrotation, U C G/T is mea- 
surable and of positive measure and K > 1, then there is a countable set of pairs 
{(^1) '"i)) ^2)5 • • •} (which could be finite or infinite) such that 

(i) each Vi has positive measure; 

(ii) ifi 7^ i' then the unions IJ^-x^'^*'^^ ^'^'^ \Jk=-K disjoint; 

(iv) Wm(f/\U>iUf=-i^5'''^^^) =0. 

Proof. The proof given here invokes Zorn's Lemma, although a more careful 
argument shows that it only really needs the ability to induct transfinitely below 
Frastratingly, I have not been able to find a proof that avoids this kind of 
induction entirely, although in the finitary analog of this step all sets are finite and 
so the issue does not arise. 

Let A be the set of all countable families of pairs {(Vi, ri), (V2, r2), . . .} that 
have properties (i-iii) above (but possibly not (iv)), and order A by inclusion of 
families. If J" = {(Vi, ri), (V2, r2), . . .} G ^ then set 

K 

m(^):=m(U U Z^'V^.)- 

i>\k=-K 

Since m{U) > 0, Theorem 12. 5 I promises some r such that 

m{g-^'^U n g-^^-^^'U n • • • n g^'U) > 0. 

Therefore the set V := g~'^'^U D g^^^^^^'^U fl • • • fl g^^'U has positive measure 
and satisfies g'^^V C U for every —K < k < K, so {{V, r)} G A and hence A is 
nonempty. 

Now suppose that {Ta)a is a totally ordered family in A. Since m{V) > 
for any {V, r) G Ta, the values of the measures m{Tct) are totally ordered, are all 
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distinct and are bounded by 1. We may therefore extract a non-decreasing sequence 
C J-"q2 C • • • such that m{Fai) ~^ sup^, m{Ta) as i ^ oo. 
Since each Ta is countable and they are totally ordered, it follows that Q := 
|Jj>i J^Oi is still countable, and in fact is still a member of A. Moreover, if {V, r) G 
Ta for some a, then this pair must actually appear in some , for otherwise we 

would have m{Fai) — "^(-^a) ~ Ufc=-ii' 5''^'^^) for every i, contradicting our 
construction. Hence Q Z) Ta for all a, and so Q is an upper bound for the chain 

Therefore by Zom's Lemma the whole family A has a maximal element, say 
T = {{Vi,ri), {V2,r2), ■ ■ .}. Now we need simply observe that this must have 
m{T) = m{A) (which implies property (iv)), since otherwise another appeal to 
Theorem |23] would give C C/ \ U 7" and r' > 1 such that F U {{V , r')} G A, 
contradicting the maximality of F. Therefore F has all the desired properties, and 
the proof is complete. □ 

Remark. The finitary analog of this result in HGowOU (see his Corollary 5.6) is 
very elementary and quantitative. I suspect that a version of Gowers' proof could 
be adapted to the present setting (perhaps with some additional assumptions on 
U, such as that it be open with piecewise-smooth boundary), but that this would 
require the use of a Mal'cev basis for G and the ability to study orbits of Rg in 
terms of 'explicit' generalized polynomials using the resulting coordinate system. 
Such a more quantitative argument would probably be considerably longer than the 
proof given above. < 

We can now complete the density-increment proof of multiple recurrence using 
the above proposition and Corollary [ 



Proof of Proposition 12. 71 Suppose that {X D A, ji, T) is a process having ^i{A) =: 
6 > but no fc-APs in its return times. Then Corollary 12. 6 1 gives 

\\E{A\7r)-f^{A)h>^6' 

for some factor map vr : {X, /i, T) (G/F, m, Rg) onto a (fc — 2)-step nilrotation. 

Since G/T is compact and its Borel a-algebra is generated by its open sets, we 
can find a finite Borel partition U of G/T into small-diameter positive-measure 
pieces such that 



L-/|E(^|.)-M^|.-'f/)|d™<^J' 



m{U) 



'U 

for all U G U\Uhad, where Z//bad is a subcollection such that m{[J Z^bad) < S''/20k. 
Combined with the preceding inequality, this imphes that there is some U G W for 
which 

Now given > 1 choose K := LN > 1 with L > 1 so large that l/L < 
and L > 2/e + 1. Apply Proposition 12.91 to the set U to obtain pairs 
(^1) '^i). (^2) ^'2)^ ■ ■ ■ with each Vi having positive measure and such that the unions 
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[jf.^_j^ g^^^Vi for i > 1 are pairwise disjoint, all contained in U and together fill 
up m-almost all of U. In view of the convex combination 



i>l 



m{U) 



k=-K 



there is some i for which 
J A 



k=-K 

Letting B' = Ufj-^"' g'^^V, and 

K 

C:= \J g'^'^vXB', 



k=-K 



the shifts C, 5-'''(2JV+i)^^ g-2n(2N+i)(j^ _ _ _ ^ ^^^L-i)n{2N+i)(j pairwise dis- 
joint and contained in (J^-x 9^^^^i' ^^^h has measure at most U^-i^: g^'^^^i) 
and therefore 

A' 



, L-1 
m(i5 ) > — - — m 



k=-K 

In addition, the set difference g^'^ B' \ B' is contained in C and so has measure 
at most 

K 



k=-K 

and a symmetrical argument controls the measure of B' \ g^^B' so together we 
obtain 

2 

m{B'Ag'''B') < m{B') < em{B'). 

L — 1 

Finally, letting B := g-^'^'^B', it follows that ^^'"S C Uf=-x5'''^^i ^r all 
-A^ <n< N, that 



K 



> ml A 



m((:7) 



k=-K 



> 5 + J_5fc_l 
10/c L 

- 20A; ' 

and that B enjoys the same approximate ^'''-invariance as B', completing the proof 
of Proposition O with Ck{6) := -^S''. □ 
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3. Two COMMUTING TRANSFORMATIONS 

3.1. The density increment in higher dimensions. With the appearance of Cow- 
ers' density-increment proof of Szemeredi's Theorem, it became natural to ask 
whether a similar approach can yield improved upper bounds for any cases of the 
multidimensional Szemeredi Theorem. Gowers discusses this question explicitly 
in H GowOO l. It poses significant new challenges, and remains mostly open. For the 
analogous ergodic-theoretic study of multiple recurrence we will see that the dif- 
ficulty arises from the nature of the characteristic factors in multiple dimensions, 
which are rather more complicated than the pro-nilsystems that give the complete 
picture for powers of a single ergodic transformation. 

In the context of finitary proofs, it is still possible to set up a 'directional' vari- 
ant of the norms (actually now just seminorms) that Gowers introduced to define 
uniformity, and to show that the resulting new notion of uniformity does control 
the count of the desired patterns in a subset £' C {1, 2, . . . , A^}'^. The difficulty is 
in handling those sets, or more generally functions / : {1, 2, . . . , [-1, 1], 

which are not uniform in the sense of this seminorm. Extending the approach of 
Roth and Gowers requires one to find the appropriate class of functions against 
which an arbitrary function must see a large correlation if it is not uniform. For 
uniformity of degree k in the one-dimensional setting, these were the functions 
which on many long arithmetic subprogressions of {1,2,... ,A^} agree with the 
exponential of i times some degree-A; real polynomials (see the discussion follow- 
ing Corollary I2.6l l. but in the multi-dimensional setting they are much more com- 
plicated. Part of the difficulty in extending Gowers' approach lies in the problem 
of identifying the most appropriate class of functions to use here, and part of it 
lies in establishing some necessary properties of those functions once they have 
been found (properties which are fairly classical in the case of the one-dimensional 
'local' polynomial functions). 

However, in spite of these difficulties, Gowers-like bounds have now been ob- 
tained in the following special case of Theorem 1 1.4l by Shkredov: 

Theorem 3.1. There is some absolute constant C > such that if 6 > 0, N > 

2^ and A C {1, 2, . . . , A^}^ has \ A\ > SN'^, then A contains a comer: 

A 5 {a, a + rei, a + re2} 

for some a G {1, 2, . . . , N}"^ and r > 1, where ei, e2 are the standard basis 
vectors in 1?. 

In fact, since the appearance of his original article IIShk06bL in IIShk06all Shkre- 
dov has improved the above bound further to the form iP^'^ , effectively by re- 
placing a repeated descent to arithmetic subprogressions with a descent through 
a nested sequence of Bohr sets, following Bourgain's use of these for his im- 
proved bounds in Roth's Theorem IIBou99l IBouOSl . In addition, Shkredov has 
shown in IIShk09l how this latter argument can also be implemented in the setting 
of arbitrary finite Abelian groups (see also Section 5 of Green's survey I.Gre05l 
for a treatment of the case of high-dimensional vector spaces over a finite field). 
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However, for the sake of simplicity this note will focus on analogs of the original 
paper IIShk06bl . and where appropriate make comparisons to the steps taken there. 
Thus, we here present a new proof of the following special case of Theorem 1 1.3 1 

Theorem 3.2. IfTi,T2 : TL r\ (X, [i) commute and A C X has > then 

1 ^ 

lim inf — V n rr" A n T""^) > o. 

n=l 

Henceforth we will generally refer to the quadruple {X, fi,Ti,T2) as a I?- 
system, in reference to the action of the whole group generated by Ti and T2. 

In contrast with our work in the previous section, the analog of Theorem 12.41 
that will appear in this setting does not reduce our study to a class of systems for 
which multiple recurrence can simply be proved directly, as was the case using 
Theorem 12.51 For this reason, although Theorem 13.21 has of course been known 
since Furstenberg and Katznelson's work, the proof presented here is not quite so 
redundant as is the density-increment proof in one dimension (recall the discussion 
following the statement of Theorem 12.51 ). 

An important aspect of Shkredov's proof is the introduction, in addition to E C 
{1,2,..., N}'^, of a superset of it which is a product set Fi XF2 which must also be 
manipulated as the proof proceeds. We will employ a similar idea in the following, 
where for a system {X, fi, Ti , T2) the structure of a 'product set' is replaced by that 
of an intersection of sets which are invariant under either Ti or T2. The importance 
of these special sets corresponds to the emergence of the factor generated by the 
Ti- or T2 -invariant sets within the structure of the characteristic factors. With this 
in mind, we make the following analog of Definition 12. II 

Definition 3.3 (Augmented process). An augmented process is a 1? -system {X, fx, Ti 
together with distinguished measurable subsets A, Ei and E2 satisfying A C 
El n E2 and such that Ei is Ti-invariant. We shall sometimes denote these data by 
{X D EinE2D A,n,Ti,T2). 

Definition 3.4. An augmented process has no comers in its return set if 

n{A n Tf n Tg-^A) = Vn / 0. 

In addition, the following notation will be used throughout the sequel. 

Definition 3.5 (Partially invariant sets). If {X, ij,,Ti,T2) is a I?-system, then a 
subset A <Z X is partially invariant if it is invariant under T"^ for some 
(ni,n2). The a-algebra of (T^^T2^)-invariant measurable sets is denoted by 
5](ni,n.2)^ in addition we let (^^^'"'^^ jy^ some factor map X — )■ ^^"i'"^) ^^^^ 
an auxiliary system where the transformation in direction (ni,n2) is trivial and 
which generates S^"!'"^). 

(This correspondence between globally invariant cj-subalgebras of S and factor 
maps onto other systems is standard in ergodic theory; see, for instance. Chapter 2 
of HAusal and the references given there.) 
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Definition 3.6 (Kronecker factors). If {X, fi, Ti,T2) is an ergodic Z'^ -system then 
(i will denote some choice of a factor map from X onto an action by rotations on 
a compact Abelian group which generates the Kronecker factor of {X, fi, Ti, T2), 
and similarly for Z-systems. 

Definition 3.7 (Arithmetic of factors). Given two factor maps ni : {X, ii,Ti,T2) 
{Yi, Ui, Si^i, S2,i) of a l?-system, we let vri V 112 denote a factor map which gener- 
ates the same a-algebra as vri and 112 together (for example, the Cartesian product 
map (7ri,7r2) : X ^ Yi 'XY2) will do), and vri A 112 denote a factor map which 
generates the a-algebra of all sets that are both vri- <^nd TT2-measurable. 

In his setting, Shkredov considered nested inclusions 



His main innovation is the result that in order to count approximately the number 
of comers in E it suffices to control the non-uniformity of E relative to its superset 
Fi X F2, and crucially to an extent which depends only on the relative density 



He effectively formulated this latter uniformity condition in terms of a uniform 
bound on the one-dimensional Fourier coefficients of the Fi, but for our sets Ei G 
Tj^^ it turns out that a stronger condition is more convenient, formulated in terms 
of the independence of their shifts under Tj for j i; this condition will appear 
shortly. 

The need for the Ei below becomes natural upon understanding the analog of 
Theorem 12.41 for the averages of Theorem 13.21 However, in the ergodic theoretic 
world this involves another new twist, which has no real analog in the finitary 
setting. It turns out that simply-described characteristic factors for the averages 
of Theorem 13.21 may be obtained only after ascending to some extension of the 
initially-given system. (The original system will certainly have characteristic fac- 
tors, but they may be much more complicated to describe.) Of course, it suffices 
to prove multiple recurrence for such an extension, and so this is quite adequate 
for our proof strategy. The following result is specialized from the construction of 
so-called 'pleasant and isotropized extensions' in |Aus091lAuslOII . 

Theorem 3.8. Any 1? -system , ) has an extension 



E C Fi X F2 C {1,2,... 



iV}2. 



\E\ 



, provided the sets Fi and F2 have some uniformity properties of their own. 



]TTPF 



7r:(X,/i,ri,r2)^(X°,^°,T°,T°) 



with the property that 
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as N ^ oo for any /o, /i, /2 G L°°{fi), where 

TTO •- Co VCo 

^ ._ ^(1,0) ^(1,-1) 
TTl .- Co VCo 

7r2 •- Co VCo • 

□ 

Definition 3.9 (Pleasant system). Essentially following the nomenclature of I* AusOQll . 

we will refer to a system having the property of the extension constructed above as 
pleasant. 

Replacing an initially-given Z^-system with an extension if necessary, we may 
henceforth concentrate on pleasant systems. 

With this description of the characteristic factors in hand, we can now offer our 
ergodic theoretic translation of Shkredov's main estimate (Theorem 7 in [,Shk06bJ ). 

Proposition 3.10. Suppose that {X D Ei D E2 D A, ^,Ti,T2) is a pleasant 
augmented process with fJ,{A) > 0, that 

• the return-set of A contains no corners, and 

• El 1. T2{Ei) and E2 -L T]^{E2)for all n 7^ 0, where _L denotes indepen- 
dence, 

and let vro := Co^''^^ V Co'''^'*- Then 

W^M I ^0) - i^{A I El n ^2)||l2(;.(. lEinsa)) > /^(^ I El n ^2)^. 

The benefit of working with the conditions Ei _L T|*(E'i) is that they will be 
relatively easy to recover for the new process that we construct during the coming 
density increment. We will see shortly (Corollary I3.131 l that this condition implies 
that El is orthogonal to the Kronecker factor Ci^, and this orthogonality is a truer 
ergodic-theoretic analog of Shkredov's condition that they be degree- 1 uniformity. 

Proposition 13. 101 will be proved in Subsection 13.31 

3.2. A closer look at the characteristic factors and the main estimate. Before 
proving Proposition 13.101 we need some simple auxiliary results about the factors 
appearing in Theorem 13. 8 1 

Lemma 3.11. If {X, fi,Ti,T2) is ergodic as a Wp'-system, then any two of the 

factors C'o'^\ Co^'^\ Co^' '^f^ independent, and the three together are relatively 
independent over their intersections with the Kronecker factor: 

cfAcr,cfAcr,cfAcr^^ 

Proof. The first assertion is an immediate consequence of the commutativity of Ti 
and T2. We prove it for Co^'°^ and Co'^'^^ the other pairs being similar: since Ti and 
T2 commute, if Ai is Ti-invariant then the conditional expectation E{Ai \ Cq^'^^) 
is invariant under both Ti and T2 and hence constant, by ergodicity, and must 
therefore simply equal 
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Handling the three factors together is only a little trickier. If Ai G Co^'"'*' ^2 G 
Cq^'^^ and A12 € Co^' then by the first assertion the target of the factor map 
Co^'^^ V Co''^^ can simply be identified with a Cartesian product system 

(Yi X Y2, vi (g) V2, S2 X id, id x Si) 

where ^2 is an ergodic transformation of the first coordinate alone and Si an er- 
godic transformation of the second. The fact that the invariant measure of this 
target system is a product z/i (g) z/2 corresponds to the independence of Co^ '^'* and 
Co''^^- In this picture the set Ai is lifted from some subset A'- C Yi under the further 
coordinate projection 11x12—^ ^i- Since A12 is (q^' ^^-measurable one has 

n{Ai n A2 n A12) = / E(Ai n ^2 1 '"^V 1ai2 d/i, 
Jx 

and on Yi x I2 the conditional expectation £(^41 n A2 \ Cq^' is identified with 
the conditional expectation of A[ x A'2 onto the sets invariant under S2^ x Si. 

It is standard that the invariant sets of a product of ergodic systems depend only 
on the product of their Kronecker factors (see, for instance, the more general Theo- 
rem 7. 1 in Furstenberg's original paper IFur771 ). and so our conditional expectation 
of A'l X A'2 is actually onto the invariant sets of Ci^ x (^^^ , whose lifts back up to 

X must all be measurable with respect to (f. Therefore E(Ai n A2 \ Cq^' is 
actually Ci^-measurable, and so the above integral is equal to 

/ E(AinA2|cJ''"'))-E(^i2|CrACi''"'^)dM= / lA,nA,-E{Ai2\CTACi''~'^)dfi. 
Jx Jx 

Applying a symmetric argument to the other sets Ai now shows that this equals 

/ E(^i I Cf A C?'°^) • E(^2 I Cf A C''^) ■ E{Ai2 I Cf A Ci^'-'^) d^, 
Jx 

which is the desired assertion of relative independence. □ 

Remark. The second part of the above lemma, although a very simple consequence 
of classical results in ergodic theory, has an important counterpart in Lemma 1 (4) 
of IIShk06bi It corresponds to the assertion that if sets Fi, F2, -F12 C Z/NZ are 
lifted through the coordinate projections 

(ni,n2) i-^- ni, n2, ni + 722 respectively 

and if in addition they are all linearly uniform (meaning that their Fourier coeffi- 
cients are all small), then their lifts are approximately independent. In his paper 
Shkredov phrases this in terms of the approximate constancy of a certain convolu- 
tion of two functions that are lifted from Z/NIj in this way. <] 

Lemma 3.12. Suppose that {Y, u, S) is an ergodic 'L-system and let Qf : (Y, u, S) 

{Z, mz, R) be its Kronecker factor Then for any f,gG L'^{v), any B C X with 
viB) > that is (^f -measurable, and any e > 0, the set 

{n € Z : \jj-{goS^)dv- E,(/ [ Cf ) • E,(5 o | Cf ) dv\ < e] 
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has density 1 in Z. 

Remark. The conclusion of this lemma may be re-phrased as asserting that 

E.(/ • {g o S") I Cf ) ~ E,(/ I Cf ) • E,{g o 5" | Cf ) 

weakly in L'^{mz) o c L'^{i^) as n — > oo along some full-density subset of Z. 
Strong convergence here for all / and g, rather than weak convergence, would be 
equivalent to {Y, v, S) being relatively weakly mixing over its Kronecker factor, 
which is not always the case. <] 

Proof. On the one hand 

/ f-{goS^)du= [ (/lB)-(<7o5")di. 
Jb Jy 

and on the other E,y(/ Is = E,y(/ | Cf)!^, because i? is already -measurable, 
so after replacing / with fls if necessary it suffices to treat the case B = Y. The 
desired assertion is now simply that 

(/,5o5")~(E(/|Cf),E(5|Cf)o5") 

as n — > oo outside some zero-density set of 'exceptional times' in Z, and this is a 
well-known property of the Kronecker factor (see, for instance, Furstenberg HFurSlD . 

□ 

Corollary 3.13. If {Y, u, S) is an ergodic %-system and E C Y is such that E ± 
S"" {E) for all n ^ ^ then E is independent from the a-algebra generated by (^f 
under fi. 

Proof. The degenerate case B = Y of the preceding lemma shows that asymptoti- 
cally for most n we have 

u{E n 5-"^) « ^ E,{E I cf ) • {E,{E I Cf ) o 5") du. 

Since the Kronecker factor {Z, mz, R) is a compact system, for any e > there 
is some nonempty Bohr set in Z along which the right-hand values above return 
within £ of 

|^E^(i^|Cf)'dz.= ||E^(i?|Cf)||i. 

This Bohr set must have positive density and therefore contain a further subset of 
values of n where our first approximation above is also good. This implies that for 
any e > there are infinitely many n for which 

|z.(i^n5"(i^))-||E^(i^|Cf)||i| <e, 

but on the other hand our assumption on E implies that 

,^{E n S'^iE)) = u{Ef = \\E^{E I cf )||? Vn / 0. 

This is possible only if ||E^(£' [ Cf )||i = I Cf )||2^ which in turn requires 

that E^(£' I cf ) be constant, as required. □ 
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Lemma 3.14. If {X, fi,Ti,T2) is an ergodic I?-system and Ei G T,^^''^\ E2 G 
are such that Ei _L T^^ {Ei) for all n 7^ whenever = {1,2}, then 

also El (resp. E2) is independent from (q^'^^ V Cq^' (resp. Cq^''^'' V Cq^' )■ 

Remark. For us this is analogous to the way Shkredov uses his Lemma 1 to estimate 
the second term in equation (21) in his Theorem 7. <l 

Proof. The second part of Lemma [3. 11 [ implies 

^A^i\Qo VCo ) - ^A^AEi\Qi a Co j Ko V C j- 

Corollary 13. 13l now gives that E^(E'i [ Cj A Co^'*^^) is constant, and hence so is the 
conditional expectation of interest. The proof for E2 is exactly similar. □ 

3.3. The main estimate. 

Proof of Proposition \3. 10\ Define the trilinear form A on L°°{fi)^ by 

1 ^ r 

A(/o, /i, /2) := lim /o • (/i ° ^ ■ (/2 o T^) d^. 

(In fact this is the integral of the function /o (8) /i (8) /2 against a certain three-fold 
self -joining of the system {X, fi, Ti, T2) called the 'Furstenberg self-joining'. We 
will not use that more elaborate formalism here, but refer the reader to HAusal and 
the references given there for a detailed explanation, as well as a proof that the 
limit exists.) 

Our assumptions include that A(^, A,A)=0 (where we have simply written A 
in place of l^i), but on the other hand by Theorem 13. 8 1 we have 

AiA,A,A) = AiE{A\no),A,A) 

= A{E{A\7ro)- fj.{A\ El nE2)lErnE2, A A) 

I El n E2) ■ A{Ei n E2,A, A). 

We now estimate these two terms separately. 

First term Directly from the definition of A we deduce that 

|A(E(A I ^0) - I El n E2)lE^nE2,A A) \ 

< A{\E{A I ^0) - fiiA I El n E2)lE,nE,\,A, A) 

< A{\E{A I ^0) - fi{A I El n E2)lE,nE2lEi n E2, Ei n E2), 

where the second inequality uses that these three functions are non-negative and 
that I A < ^EinE2- Now another appeal to Theorem |3 . 8 1 shows that this last upper 
bound is equal to 

A{\E{A\Tro)-fi{A\EinE2)lE^nE2\, E{Ei n E2\tti), E(Si n ^2 | vra)) . 

From our hypothesis that Ei _L T^(£'i) for all nonzero n and Lemma [3. 141 it 
follows that E2 is 7r2 -measurable whereas Ei is independent from 712, and hence 
that 

E(^in^2|vr2) = ^(^i)1b2, 
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and similarly with the two indices reversed. Given this we can re-write the above 
term as 

/z(£;i)/x(£2)A(|E(A I TTo) - fiiA I El n E2)lE,nE,\,EuE2) 

1 ^ r 

= fi{Ei)ti{E2) Ihn ^V/ \E{A\Tro)-,i{A\EinE2)lE,nE,\ 

■^Tf "(El) ■ Ir2-"(E2) 
= fi{Ei)n{E2) I \E{A\Tro)- fi{A\EinE2)lE^nE2\dn, 

JX 

where for the second equality we have now used that Ei is Tj-invariant and that 

\£{A 1 7ro)-/x(A I Eir\E2)lEinE2HE^-'^E2 = |E(A 1 7ro)-/i(A | £?in£;2)lEinE2l> 

which in turn holds because Ei n E2 is ttq -measurable while A d EiC] E2, so that 
both E(^ I TTo) and lEinE2 are still supported on £Ji n £'2. 

This integral (which no longer involves the triUnear form A) may now be iden- 
tified as 

fi{Ei n E2f\\E{A I vro) - fi{A \ Ei n E2)lE,nE2 IIl^M- I E^nE,)) 

< ix{Ei n E2f\\E{A I vro) - l^{A \ Ei n £2)lEinE2 | EmEa)). 

using the fact that S(i'O) andS(O'i) are independent to write fi{Ei)fx{E2) = /x(-Ein 
E2) and using Holder's inequality for the final upper bound. 

Second term This is much simpler: since A C EiH E2 and E^ is Tj-invariant 
we have 

1 ^ 

A{EinE2,A,A) = lim - V //((Ei n E2) n Tf M n r2-M) 

n=l 



= lim ^ V /[/(rr"Ei n TfM n r^-^Es n ro-"^) 

N-i-oo N 

n=l 

N^oo iV — ' 
n=l 

= J- ^EM^n(r2T-)-M) 



= ||E,(A|ci^'-^)Mi2 



n=l 

i)mi 

2 



(using the Mean Ergodic Theorem for the last equality), and by another appeal to 
Holder's inequahty this is bounded below by 

\\E^{A I '~'^)||? = ti{Af = fi{A I El n E2ffiiEi n ^2)^ 
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Combining the estimates Using the inequalities just obtained in our original 
decomposition of h.{A, A, A) we find that 

= A(^, A, A) > I El n ^2) n 

- \\E{A I vr) - ^x{A I El n ^2)||l2(M- | SinE^)) • /^(^i n ^2)^ 
so re-arranging gives the desired result. □ 

3.4. Shkredov's version of the density increment. We can now present Shkxe- 
dov's main increment result (which corresponds roughly to the conjunction of 
Proposition 12 .7 1 and Corollary 12.8 l in the one-dimensional setting): 

Proposition 3.15. There is a nondecreasing function c : (0, 1] — ?• (0, 1] which is 
bounded away from on compact subsets of (0, 1] and has the following property. 
If {X D El n E2 D A, fi, Ti , Ti) is such that 

(i) f,{A) > 0, 

(ii) the return-set of A contains no nontrivial comers, and 

(iii) El ± T2^{Ei)for all n ^ Q and similarly for E2, 

and if we set 6 := fj,{A \ Ei D E2), then there exists another augmented process 

{X' Z) E'inE'2D A',i^',Ti,T^) 

having the analogous properties ( i-iii) and such that 

fi'{A' \E'inE'2) >S + c{6). 

Remark. Shkredov's argument does not give any effective control over the size 
of the sets E'- in terms of the Ei — in particular, it could happen that they are very 
much smaller — but the point is that this is not needed. <1 

Proof. This breaks naturally into two steps. 

Step 1 Extending {X, fi, Ti, T2) and lifting A and the Ei if necessary, we may 
assume the system is pleasant. Now by Proposition 13.101 conditions (i) and (ii) 
imply that 

||E(^|7ro) n^2)||L2{M-|Sini?2)) > 

and hence there is some non-negligible ttq -measurable set F such that 

fi{A\F) > 6 + 5^/2. 

Moreover, since ttq is generated by Co^ '^^ ^rid Co'^'^^ ^fter approximating this F by 
a disjoint union of intersections of Ti- or T2 -invariant sets we may assume that it 
is itself of the form Fi n F2 for some Fi G cS^'°\ F2 G (S^'^K 

Naively we should hke to replace A C Ei D E2 with A D Fi n F2 C Fi D F2, 
but these sets Fi may not satisfy Fi ± T"Fj for n 7^ 0. We resolve this by another 
conditioning and a vague limit construction. Note at this point that this selection 
of the sets Fi will be responsible for our lack of control over fi'{E'-) in terms of 

Step 2 Let 

X' := ({0,l}x{0,l}x{0,l})^' 
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with its product Borel space structure, let T{,T2 be the two coordinate-shifts on 
this space, and let E[, E2 and A' be the three obvious time-zero cylinder sets of 
X': 

E[ := {(ljI^.lu'I^uj^) e X' : luq = 1} and similarly. 

We will show that for any e > and K > 1 there is a probabihty measure u on 
X' such that 

• is approximately invariant: \iy{C) — z^((T/)~^C)| < e for any C C X' 

and i = 1,2, 

• u{E[ n E'2) > (5-V20)m(Fi n F2), 

• uiE'-ATlE'i) = fori = 1,2, 

• \u{ElA{T^)-''El) - iy{El)^\ < e for all nonzero -K < k < K for 

{i,j} = {l,2},and 

• u{A' I E'^ r\E'2)>6 + 5^/2. 

Given this, we may take a sequence of such measures as £ 4- and K 00 and 
let /x' be a vague limit of some subsequence to obtain an augmented process 

{X'DE[nE'2DA',^l',T[,T^) 

having all the desired properties. The T^'-invariance of follows from the approx- 
imate invariance of the measures u, and the T^-invariance of E'- is only up to a 
/i'-negligible set, but this may then be repaired by replacing E'- with Un(^/)"^^i' 
which differs from E'- only by a /x'-neghgible set. The second of the above points 
ensures that the limit /x' is non-trivial insofar as ^'{A'), fi'{E[ H E'2) > 0. 

Now fix e and K. To obtain such a u, let {Z, m, i?i , i?2) be a compact group ro- 
tation isomorphic to the Kronecker factor of {X, ^, Ti, T2) with factor map Cj^ =: 
( : X Z, and let Ube a. Borel partition of Z into sufficiently small pieces that 

for allU eU\ ZYbad where m{[j ZYbad) < iS^/20)n{Fi n F2). 
Considering the convex combination 

I n = ^ ^^(^i^^^I^M^ n c'u | n n c'u), 

the terms indexed by ^bad must contribute very little (because their sum cannot be 
more than 6^ /2Q if we estimate by simply ignoring the factors of //(AnC^^t/ | Fi n 
F2 n C,~'^U) < 1). Similarly, the terms for which 

^i{Fi n F2 n C^U I Fi n F2) < {5^/2())m{U) 

must also contribute very little (their sum is also less than (5^/20). Therefore there 
must be some U eU\ Wbad for which 

n F2 n c'^u I Fi n F2) > {S^/20)m{u) 

and 

ii{A n c'^u I Fi n F2 n c'^u) >s + 
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Using Bayes' formula, the first of these inequalities implies that 



^i{Fl n F2 1 c'u) 



> (5V20)/i(FinF2). 



KFi n F2) 

m{U) 



Now let F C Z be the Bohr set 

{n G Z : m(Ul\RlU) < £m{U) / 2 and m^UARl^U) < em{U)/2}. 

This is nontiivial because the rotation orbit z i-)- Iz+u is continuous from Z to 
L?{m), and so V has some (perhaps very small) positive density in Z. In view of 
this positive density, Lemma [3. 121 implies that each of the sets 



still has relative density 1 inside V for any A; 7^ and j = 1 or 2, because the 
whole set Z \ Vj^u has density zero. Hence we may choose some r € r > 1 that 
hes in every Vj^k for j = 1, 2 and k G {-K, -K + 1, . . . , K} \ {0}. On the other 
hand, the approximation that defines the members of U\Ubad and the approximate 
return of U to itself under i?" for n G F imply that 



for all nonzero —K < k < K, where the error incurred is at most e/4+e/4 = e/2. 
Now consider the map 



and let v be the image measure • | C ^U) on X'. We will show that this has 

the five desired properties: 

• approximate invariance of u follows from approximate invariance of U 





Lp: X ^ X' -.x^ [lFi{T'2'^''x), lF2(r["ix), lAnFinF2(7'[ 



along V: 



v{C)-u{{Tl)-'C)\ 



Uv-'c\c'u)-i,{Tr'v-'c\c'u)\ 
f,{ip'^cn{C-'UAT[C-'u)) 



< 



e/2 < e 



for any C C X'; 
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• a simple calculation gives 

u{E[nE'2) = fi{FinF2\C'U), 

and this is at least (5^/20)/i(Fi n F2) by our choice of U; 

• similarly, 

u{ElATlEl) = fi{F,ATiFi \ C^U) = 

for i = 1,2; 

• for any nonzero —K<k<Kwe have 

,.{E'i n {T^r'ED = ii{F, n T-^^F, I c^ui 

and by our selection of r this is within e/2 of 

^/^MF.K).E,(r,-F,K)d™, 

which in turn is within e/2 of 

l,{F,\Q-^Uf = v{E';)\ 

giving the required estimate; 

• lastly, our choice of U also guarantees that 

v{A' I E[ n E'2) = ii{A n C^u 1 Fi n F2 n C^U) >5 + 5^4, 

as required. 

This completes the proof with c{5) -.= 5^/4:. □ 

Remark. The two steps above can also be loosely identified with two steps in 
Shkredov's work. The first is similar to the conjunction of Lemma 1 1 and Propo- 
sition 3 in Section 3 of [Shk06b|, whose use appears at the beginning of the proof 
of Theorem 4. The second, rather more involved, amounts to Corollary 1 and the 
various auxiliary results needed to reach it in Section 4 of fShk06bl, which then 
underpin the second step of each increment in the proof of Shkredov's Theorem 
4. < 

Proof of Theorem [O] This now proceeds almost exactly as for Theorem 11.21 

Suppose there exists an augmented process (X D £^1 PI £'2 ^ A, fi, Ti, £2) such 
that > and hence n{A \ Ei n E2) =: > 0, Ei ± Tf{Ei) for all n > 0, 
and for which 

1 ^ 

n=l 

In particular, if {X D ^, ;U, Ti , T2) is a process violating Theorem 13.21 then {X D 
X n X D ^, ;U, Ti , T2) is an augmented process with these properties. 

From these data one can construct another augmented process D Gi n G2 D 
B, V, Si, S2) such that v{B) = ^{A), u{Gi) = fi{Ei) and this new process actu- 
ally has no comers in its return set. This construction proceeds in exact analogy 
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with Step 1 in the proof Theorem 11.31 from Corollary 12.81 the initial process is 
transferred to the symbolic space 



where now the three copies of {0, 1} above the coordinate (ni,n2) receive the 
indicator functions of Ie^ o ^ T^^ ,1e^oT^^ T^^ and 1a o T^' T^^ respectively ; 
and then averaging over dilations constructs a new shift-invariant measure on this 
symbolic space that retains the properties of the original system but actually has no 
comers in its return set. A quick check shows that if Gi , G2 and B denote the one- 
dimensional cylinder sets defined by the three different {0, l}-valued coordinates 
above (0, 0) in Y, then the Gi retain the property of i^-a.s. invariance under Si and 
also the property that Gi _L S^{Gi) for all n / (because the measure i>{Gi n 
Sj'{Gi)) is obtained as an average over m of fi{Ei n T'^"^{Ei)), and these are all 
equal to ii{Ei)'^ = v{Gi)'^ by assumption). 

Now implementing Proposition 13. 151 one can construct from (F D Gi fl G2 D 
B,^, 81,82) a new augmented process {X' D E[ D E'^ D A',^',T{,T2) which 
still has all the properties (i-iii) and for which iJ,{A' \ E[r\E'2) > 6o+c{6o). Since c 
is uniformly positive on [5q, 1], after iterating this construction finitely many times 
we obtain an example of an augmented process for which this relative density is 
greater than 1, a contradiction. □ 

Remark. The above treatment bears comparison with how Shkredov assembles the 
various components of the proof of his main result, Theorem 4, in IIShk06bl . <l 



Theorem l3.2l remains the most elaborate higher-dimensional case of Theorem ! 1.41 
to be successfully proved using a density-increment argument, or to be given bounds 
that improve over the hypergraph-regularity proofs of the general theorem obtained 
in IIGow071 and [NRS06 I. Perhaps the most obvious obstruction to further progress 
is that the various 'inverse theorems' that are known for the relevant notions of uni- 
formity remain incomplete. However, in the ergodic-theoretic world these corre- 
spond to 'characteristic factor' theorems such as Theorem 13. 8 [ and recent work has 
in fact taken these a little further. The following result appears (in a slightly more 
general form) as Theorem 1. 1 in HAuscl . where it is used for a different purpose. 

Theorem 4.1. Any ergodic -system T°, ) admits an ergodic exten- 

sion 



y:=({0,l}x{0,l}x{0,l}) 



I? 



4. Further discussion 



7:■.{X,^l,T^,T2)^{X°,^l\Tl,T^) 



with the property that 



N 



1 



2^/ k■{hoT^)■{f2oT^)■{hoT^T^)d^Ji 





^ / E(/oKo)-(E(/iki)orn-(E(/2k2)oT2").(E(/3K3)oTi"r; 
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in L'^{fi) as N ^ oo for any /o, /i, /2, /s e L'^ifJ'), where 

'w -TT — a(1.0)^(0,1)w^(1,1)waT 
TTO - VTs .- (,0 V (,0 V (,0 V (,2^nii 

>(1,0) , , ^(1,-1) w a(0,1) , , 
VTl = vrs := Co V Co ' V Cd ' V C2,nih 

anJ where Ca^nii denotes a factor generated by an inverse limit of a sequence of 
actions ofl? by two-step nilrotations. 

Once again, these vTj are referred to as the 'characteristic' factors for these mul- 
tiple averages. 

Moreover, a relatively simple extension of Lemma l3. 1 1 I shows that the four fac- 
tors Co^ '^^ Co'^^^' Co^'^^ Co^' that appear above are relatively independent 
over their further intersections with Cjnii ^^^^ Proposition 5.3 in [ AuscD . Theo- 
rem l4.1l and this second result are both known special cases of a general conjecture 
on the joint distributions of partially invariant factors of Z'^-systems, which may 
be found formulated carefully in Section 6 of HAusal and which suggests that an 
inverse theory for all higher-dimensional notions of uniformity generalizing the 
Gowers norms will ultimately be available. 

Theorem 14. 1 l itself bears on the special case of multiple recurrence asserting that 

1 ^ 

^x{A) > ^ lim — y n Tf "A n Tg-^A n Tf "To-"^!) > 0, 

n=l 

which in the finitary world corresponds to finding squares in dense subsets of 1?. 
The above structural results offer hope that some analog of Shkredov's density- 
increment approach may be possible through the study of pleasant augmented pro- 
cesses of the form 

(X D ^1 n ^2 n n D A, Ti, Ta) 

where Ei, E2, E^ and E^ are measurable with respect to Co^'*^^. Co*^'^^' Co^'^^ ^rid 
Cg^' respectively. Of course, more ideas would still be needed to give a new 
density-increment proof of this instance of multiple recurrence, even in the infini- 
tary setting of ergodic theory. For example. Proposition 13. lOl must be replaced with 
some more complicated estimate, and then arguments in the previous section which 
used some conditioning on the Kronecker factor would presumably be replaced by 
conditioning on Cj^nii' which can have much more complicated behaviour. 

Another interesting issue on which ergodic theory can shed some light concerns 
the difference between the problems of proving multiple recurrence for the above 
averages and for the averages 

1 ^ 

— i^i{A n Tf "A n T^^'A n T^^'A) 

n=l 

ansme from a Z^-system {X, ^, Ti, Tg, Ts). We offer only a very informal discus- 
sion of this here, since precise results on these more complex problems are still 
in their infancy. In the finitary world, these latter averages correspond to finding 
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three-dimensional comers in dense subsets of 1? , rather than squares in 1? . Since a 
triple of the form (Ti, T2, T1T2) does formally generate an action of 1?, it is clear 
that multiple recurrence for these Z^-system averages is at least as strong as its 
counterpart for the averages of Theorem 14.11 However, the identification of char- 
acteristic factors for the case of Z'^-systems is also apparently simpler: the main 
result of HAuslOl shows that, after passing to a suitable extension if necessary, one 
has 

1 ^ /■ 

TtE/ /0-(/l°T^r)-(/2°T2").(/3or3")d/. 

N 

-mT. / E(/ok^,)-(E(/i|vri)orn-(E(/2k^)or2").(E(/3K^)or3")d^ 
with 

/ ._ .(1,0,0) .(0,1,0) .(0,0,1) / ._ .(1,0,0) .(1,-1,0) ^(1,0,-1) 

^0 so V C,Q V (,Q , TT^ .— (,Q V C,Q V (,Q , 

-'2 ■■= C^'''' V C^''' V C^'^'-'^ and vr^ := ci^'^-^^ V (^'-'^ V ci°'°'^\ 
and these are the minimal factors with this property. These factors are 'simpler' in 
that they involve only partially-invariant factors, and not compact group rotations 
or nilsystems. The fact that some of the ingredients needed in Theorem 14.11 no 
longer appear here does not contradict the fact that a triple such as (Ti, T2, T1T2) 
generates a Z'^-system, because after passing to a suitable extension the algebraic 
relations among the generators of this Z-^-system will usually be lost. 

It thus appears that the analysis of the more general averages might actually be 
easier, and in fact for deploying some of the methods at our disposal this is true. 
The ergodic theoretic proof of convergence of these averages in | lAus09ll (reproving 
a result of Tao from UTaoOSI ) implicitly needs the linear independence of the group 
elements corresponding to Ti, T2 and T3. In the finitary world, the hypergraph- 
regularity proofs of the multidimensional Szemeredi Theorem must first lift the 
problem into a group Z'^ for d large enough so that one is looking for the comers 
of a d-dimensional simplex (rather than any more complicated (i-dimensional con- 
stellations) before this search can be correctly recast in the language of extremal 
hypergraph theory. 

However, both of these arguments use only the most basic, 'rough' structure 
for the data being studied, and by contrast the more refined density-increment 
approach is simpler in the case of two-dimensional squares than that of three- 
dimensional corners. Each of the superficially-simpler characteristic factors it'- for 
the three-dimensional problem is assembled from ingredients of the form for 
some V € Z^, and each of these is a factor map onto a factor of {X, /i, Ti , T2, T3) 
on which the acting group is essentially Z'^/Zv = Z^ (owing to the partial invari- 
ance). In order to mimic Shkredov's approach to these results, it is then necessary 
to know how all of these essentially two-dimensional systems are jointly distributed 
as factors of {X, /i, Ti , T2 , T3) (in order to generalize our use of Lemma l3.11l in the 
proof of Proposition 13.101 for example). It turns out that to understand this joint 
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distribution one needs the same kind of machinery as for the identification of a 
tuple of characteristic factors in the first place (the reason why these are essen- 
tially equivalent problems is discussed in detail in Chapter 4 of HAusal ): and the 
particular problem of describing the joint distribution of these 'two-dimensional' 
factors turns out to be of a similar level of complexity to the problem of describing 
characteristic factors for multiple recurrence across squares in a Z^-action. 

So the finer information required for the density-increment strategy forces one 
to understand not only the 'top-level' structural result that is contained in the iden- 
tification of a characteristic tuple of factors, but also how all the ingredients ap- 
pearing in those characteristic factors are jointly distributed. This can be of similar 
difficulty to a lower-dimensional problem of identifying characteristic factors. For 
understanding multiple recurrence across translates and dilates of some compli- 
cated constellation in U^, one might need to work with a large partially ordered 
family of factors of a given system, where the characteristic factors appear as the 
maximal elements, and several layers of smaller factors (including group rotations, 
nilsystems, or possibly something else) must also be identified in order to describe 
all the necessary joint distributions well enough to implement a density increment. 
For a density-increment proof such as in Section |3] above, this would presumably 
entail working with a much richer analog of the augmented processes that appear 
there. 

These speculations notwithstanding, serious problems surround the status of 
finitary analogs of Theorem 14. II or its generalizations. I believe such analogs are 
expected by many researchers in this field, but formulating a precise conjecture 
is already tricky, and at this writing I know of no higher-dimensional results be- 
yond Shkredov's. It is not clear what methods (extending Shkredov's or others) are 
needed to establish such structural results. Without them, the prospect of a density- 
increment proof of the presence of prescribed constellations in dense subsets of Z*^ 
seems rather remote. 
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